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We present results of a search for WH — * Ivbb production in pp collisions based on the analysis of 
1.05 fb _1 of data collected by the DO experiment at the Fermilab Tevatron, using a neural network 
for separating the signal from backgrounds. No signal-like excess is observed, and we set 95% C.L. 
upper limits on the WH production cross section multiplied by the branching ratio for H — > bb 
for Higgs boson masses between 100 and 150 GeV. For a mass of 115 GeV, we obtain an observed 
(expected) limit of 1.5 (1.4) pb, a factor of 11.4 (10.7) times larger than standard model prediction. 
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The Higgs boson is the last unobserved particle of the 
standard model (SM) . As a remnant of spontaneous elec- 
troweak symmetry breaking, it is fundamentally different 
from the other elementary particles, and its observation 
would support the hypothesis that the Higgs mechanism 
generates the masses of the weak gauge bosons and the 
charged fermions. The Higgs boson mass (mn) is not the- 
oretically predicted, but the combination of results from 
direct searches at the CERN LEP collider [l| with the 
indirect constraints from precision electroweak measure- 
ments results in a preferred range of 114.4 < m# < 190 
GeV at 95% C.L [2j. Such mass range can be probed at 
the Fermilab Tevatron collider. In this Letter, we con- 
centrate on the most sensitive production channel at the 
Tevatron for Higgs bosons of mass below 125 GeV, i.e. 
the associated production of a Higgs boson with a W 
boson. Several searches for WH production have been 
published at a center-of-mass energy of s/s = 1.96 TeV. 
Two [i, Q used subsamples (0.17 fb" 1 and 0.44 ftr 1 ) 
of the data reported in this Letter, while two others, 
from the CDF collaboration, are based on 0.32 fb _1 and 
0.95 fb _1 of integrated luminosity [1,01. 

This analysis uses 1.05 fb _1 of DO data, collected 
between April 2002 and February 2006. As in our pre- 
vious WH analyses 0, 0] , we require one high transverse 
momentum (px) lepton (e or fj,) and missing transverse 
energy fix to account for the neutrino from the W bo- 
son decay, and two jets from the decay of the Higgs boson, 
with at least one of them being identified as originating 
from a bottom (6) quark jet. We extend this data selec- 
tion by including also events with three jets and events 
with "forward" electrons detected at pseudorapidities (9j 
1 77 1 > 1.5. We also now accept the small contribution 
originating from misreconstructed ZH, in which only one 
lepton from the Z is identified. In addition we use a more 
inclusive trigger selection in the muon channel, increas- 
ing the detection efficiency from approximately 70% to 
100% [Tol, we improve the 6-jet identification using a neu- 
ral network algorithm [ll[ , and we enhance the signal to 
background discrimination using a neural network for the 
W + 2 jet events. Overall, the improvements in analy- 
sis techniques have led to an increase of about 40% in 
the sensitivity (for an equivalent luminosity) to a Higgs 
boson with mass 115 GeV, with respect to our previous 
analysis 

For the e channel, the W + jets candidate events are 
collected, with as 90% efficiency, by triggers that require 
at least one electromagnetic (EM) object in the calorime- 
ter. In the /it channel, as 90% of the candidates are col- 
lected by triggers requiring a single muon or a muon plus 
a jet, while the remaining 10% of events arc collected by 
other triggers, for a total trigger efficiency of as 100%, as 
estimated in data (lo| . 

The event selection requires one lepton candidate with 
px > 15 GeV, # T > 20 GeV [$ T >25 GeV for events with 
a forward electron), and exactly two jets with px > 25 



and 20 GeV, and \r)\ < 2.5, or exactly three jets with 
p T > 25, 20 and 20 GeV, and \r]\ < 2.5. We also require 
the scalar sum of the px of the jets to be > 60 GeV, 
the W transverse mass reconstructed from the fix 
and the lepton px to be greater than 40 GeV— 0.5x^t to 
reject multijet background, and the primary interaction 
vertex to take place within the longitudinal acceptance 
of the vertex detector. Jets are reconstructed using a 
midpoint cone algorithm [l2| with a radius of 0.5. The 
fix is calculated from energies in calorimeter cells and 
corrected for the px of identified muons. All energy cor- 
rections applied to electrons or jets are also propagated 
to the $ T - 

A central (forward) electron is required to have \r]\ < 
1.1 (1.5 < \r)\ < 2.5). To reject fake electrons originating 
mostly from instrumental effects (track-photon overlap), 
the electron candidates must satisfy two sets of identifi- 
cation ( "loose" and "tight" ) criteria [3| . The efficiencies 
of these requirements are determined from a pure sam- 
ple of Z — > e + e~ events. The differential multijet back- 
ground for every relevant distribution is then estimated 
from the loose and tight lepton samples 0, EH • The same 
statistical method is used for muons, but with different 
loose/tight definitions. Muons are reconstructed using 
information from the outer muon detector and the central 
tracker, and must have \r]\ < 2.0. To reject muons origi- 
nating from semi-leptonic decays of heavy- flavor hadrons, 
we exploit the fact that they have lower px than those 
originating from W decay, and are generally not isolated 
because of accompanying jet fragments. The loose isola- 
tion criterion is thus defined by specifying a spatial sep- 
aration between a muon and the closest jet in the rj—tp 
plane of AR = -J (Ai]) 2 + (Aip) 2 > 0.5, where ip is the 
azimuthal angle. Tighter isolation is defined by requir- 
ing little tracking and calorimetric activity around the 
muon track. 

The dominant backgrounds to WH production are 
from H^+heavy flavor jets production, top quark pair 
production (tt), and single top quark production. Signal 
(WH and ZH) and diboson processes (WW, WZ, ZZ) 
are simulated using the PYTHIA [3] event generator, 
and CTEQ6L [l5| leading-order parton distribution func- 
tions. "W+jets" events refer to W bosons produced 
in association with light-flavor jets (originating from u, 
d, s quarks or gluons) or charm jets (originating from 
c quarks), and constitute the dominant background be- 
fore 6-jet identification. Wcc and Wbb are simulated 
individually and associated as "Wbb" for purposes of 
accounting. These W boson processes are generated 
with ALPGEN [l|| interfaced to pythia for showering 
and fragmentation, since ALPGEN provides a more com- 
plete simulation of processes with high jet multiplicities 
than pythia. The it and Z+jets, events are also gen- 
erated using ALPGEn/pythia. The production of single 
top quarks is simulated with COMPHEP [l3] • 

The simulated backgrounds are normalized to their re- 
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spective NLO theoretical cross sections, with the excep- 
tion of the W+ jets and W+ heavy-flavor samples, which 
are normalized to data after subtraction of all the other 
backgrounds, before 6-jet identification. All generated 
events are processed through the DO detector simula- 
tion based on GEANT [18j| . Data collected with a ran- 
dom bunch crossing trigger are overlaid on the simulated 
events to model the occupancy of the detector which is 
dependent on the instantaneous luminosity. The result- 
ing events are then passed through the reconstruction 
software. Finally, corrections are applied to account for 
the trigger efficiency and for residual discrepancies be- 
tween the data and the simulation. 

We use a neural network 6-tagging (NN&) algo- 



rithm ll| to identify heavy-flavor jets. Its requirements 
are optimized for the best sensitivity to the Higgs bo- 
son signal. For each jet multiplicity, we form two sta- 
tistically independent samples, one (2 6-tag) with two 
6-tagged jets using a loose NN& criterion resulting in a 
6-jet efficiency of 59% and a light-jet tagging (mistag) 
probability of 1.7%, and a second (1 6-tag) with exactly 
one 6-tagged jet using a tighter NN& criterion (48% effi- 
ciency and 0.5% mistag probability). All efficiencies are 
determined for jets satisfying minimum requirements in 
terms of track quality and multiplicity ("taggable jets"), 
which constitute « 80% of all jets. In the simulations, 
the 6-tagged jets are weighted to reproduce the tagging 
rate measured in data samples. 

Using these selection criteria, we observe 885 (385) in 
the 1 6-tag W + 2 jet (W + 3 jet) samples and 136 (122) 
events in the corresponding 2 6-tag sample. Distributions 
of the dijet invariant mass, using the two jets of highest 
Pt, in W+2 jet and W^+3 jet events are shown for the 1 6- 
tag and 2 6-tag samples in Fig.QJa-d). The data are well 
described by the sum of the simulated SM processes and 
multijet background. The expected contributions from a 
Higgs boson with m# = 115 GeV are also shown. The 
expected event yields for the backgrounds and a Higgs 
boson with run = 115 GeV are compared to the observed 
number of events in Table |TJ 

Although the dijet invariant mass is a powerful variable 
for separating a Higgs boson signal from background [1] , 
the sensitivity of the analysis is enhanced through the 
use of multivariate techniques: in W+2 jet events, a 
neural network is trained on simulated signal and Wbb 
events, using seven kinematic variables: px of the highest 
and second-highest pt jet, Ai?(jeti,jet2), At/?(jeti,jet2), 
Pt (dijet system), dijet invariant mass, and priW boson 
candidate). The training is performed for every simu- 
lated Higgs signal (different test masses), and separately 
for e, [i, 1 6-tag and 2 6-tag events. The resulting neural 
networks are then applied to W + 2 jet data and to the 
background and simulated signal samples. In the final 
limit-setting procedure, the distributions of the neural 
network discriminant corresponding to a specific Higgs 
boson test mass are used for analyzing the W + 2 jet 



events. The improvement in sensitivity over just using 
the dijet invariant mass is about 15% at run — 115 GeV. 
The resulting neural network discriminants are shown in 
Fig. [1Je,f). For the W + 3 jet samples, whose dominant 
background is it, the limits are determined directly from 
the dijet mass distributions. 

Systematic uncertainties on efficiencies and from the 
propagation of other systematics (e.g. energy calibra- 
tion and detector response) are: (3-5)% for trigger effi- 
ciency; (4-5)% for lepton identification efficiency; 6% for 
jet identification efficiency and jet resolution; 5% from 
the modeling of the jet multiplicity spectrum; 3% due to 
the uncertainty in the jet energy calibration; 2-10% due 
to the uncertainty in modeling W+jets, determined by 
comparing data and expectation before 6-tagging and be- 
fore reweighting the W+ jet samples to match the data 
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FIG. 1: Dijet mass distributions for the W + 2 jet, W + 3 jet 
1 6-tag events (a,c) and 2 6-tag (b,d) events. The data are 
compared to the background prediction. The distributions in 
the neural network discriminant for W + 2 jet 1 6-tag and 2 6- 
tag events are shown in (e,f), respectively. The expectation 
from WH(xl0) production for run = 115 GeV is overlaid 
(color online). 
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TABLE I: Summary of event yields for the £ (e and /i) + 
b-tagged jets + ]£t final state. Events in data are compared 
with the expected number of 1 b-tag and 2 b-tag events in the 
W + 2 and IF + 3 jet samples, in simulated samples of diboson 
(labelled "WZ" in the table), W/Z+bb or cc_("Wbb"), W/Z+ 
light quark jets ('W+jets"), top quark ("tt" and "single t") 
production, and multijet background ("m-jet") determined 
from data (see text) . The WH expectation is given for rau = 
115 GeV, and not included in the "Total" SM expectation. 



TABLE II: Observed and expected 95% C.L. upper limits 
on the cross section times branching fraction (a x B) in pb, 
where B = B(H — + bb), for different Higgs boson mass values; 
the corresponding ratios to the predicted SM cross section are 
also given. 







[GeVj 


100 


105 


110 


115 


120 


125 


130 


135 


140 


145 


150 


cxp 
obs 


a x B 
a x B 


1.66 
2.07 


1.53 
2.08 


1.44 
1.80 


1.36 
1.46 


1.25 
1.54 


1.19 
1.21 


1.13 
1.15 


1.16 
1.12 


1.09 
1.46 


1.01 
1.10 


1.01 
0.95 



cxp. ratio 7.3 8.0 9.2 10.7 12.3 15.1 19.1 27.3 37.4 53.5 90.2 
obs. ratio 9.1 11.0 11.5 11.4 15.1 15.3 19.5 26.4 50.1 58.2 83.9 





W + 


2 jet 


W + 2 jet 


W + 3 jet 


W + 3 jet 




lb-tag 


2 b-tag 


1 b-tag 


2 b-tag 


WH 


2.8 ± 


0.3 


1.5 ± 0.2 


0.7 ± 0.1 


0.4 ± 0.1 


WZ 


34.5 ± 


3.7 


5.3 ± 0.6 


9.1 ± 1.0 


1.7 ± 0.2 


Wbb 


268 ± 


67 


54 ± 14 


87 ± 22 


22.7 ± 5.7 


W+jets 


347 ± 


87 


14.0 ± 4.4 


96 ± 24 


8.5 ± 2.7 


ti 


95 ± 


17 


37.4 ± 7.0 


156 ± 29 


81 ± 15 


single t 


49.4 ± 


9.0 


12.4 ± 2.3 


15.7 ± 2.9 


6.7 ± 1.2 


m-jet 


104 ± 


29 


8.9 ± 2.1 


54 ± 15 


8.7 ± 2.1 


Total 


896 ± 


177 


132 ± 27 


418 ± 76 


129 ± 24 


Data 


885 




136 


385 


122 



(the effect of this uncertainty on the shape of the neu- 
ral network discriminant is also taken into account); 3% 
for jet taggability; and 2% uncertainty for 6-tagging effi- 
ciency. For light quark jets, the uncertainty on the mistag 
rate is 15%. The multijet background, determined from 
data, has an uncertainty of 18-38%. The systematic un- 
certainty on the theoretical cross section for the simu- 
lated backgrounds is 6-20%, depending on the process. 
The uncertainty on the luminosity is 6% [l9j |. 

We use the CLs method 0, [2l| to assess the compat- 
ibility of data with the presence of a Higgs signal. In 
the absence of any significant enhancement, we obtain 
upper limits on WH production, using the neural net- 
work output (dijet invariant mass of the bb system) for 
the W + 2 jet (W+ 3 jet) sample as the final discrimi- 
nating variable. The 1 6-tag and 2 6-tag, and the e and 
fi channels, are treated separately, giving a total of eight 
analyses, which are then combined [4j. We incorporate 
sytematic uncertainties on signal and background expec- 
tations using Gaussian sampling and include correlations 
among the uncertainties across the analysis channels. We 
reduce the impact of sytematic uncertainties using the 
profile likelihood technique [2lj |. 

The combined upper limits obtained at the 95% C.L. 
on <r(pp — > WH) x B(H — > bb) are displayed in Fig.[2]and 
given in Table HH together with the ratios of these limits 
to the predicted SM cross section. For this analysis, all 
deviations between observed and expected limits are less 
than 1.5 standard deviations. At m# = 115 GeV, the 
observed (expected) limits are 1.5 (1.4) pb, or a factor 
of 11.4 (10.7) times higher than the SM prediction. Our 
new limits are displayed in Fig. [5] and compared to the 
expected limit from our previous analysis [4]. The im- 



provement in sensitivity is significant, and our expected 
limits scale approximately inversely with luminosity com- 
pared to our previous result. These limits are the most 
stringent to date in this process at a hadron collider. 

In summary, we have presented 95% C.L. upper limits 
on the product of WH — > Ivbb production cross section 
and branching fraction for H — > bb. These range between 
2.1 and 1.0 pb for 100 < ran < 150 GeV, while the corre- 
sponding SM predictions range from 0.23 to 0.01 pb. The 
sensitivity should increase significantly in the near fu- 
ture with the continuing accumulation of luminosity from 
the Tevatron and improvement in analysis techniques. A 
significant sensitivity gain has already been achieved by 
combining these data, with other Higgs boson searches 
done by the CDF and DO collaborations 22 1. 
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FIG. 2: 95% C.L. cross section upper limit (and corre- 
sponding expected limit) on u(pp — * WH) x B(H — > bb) 
vs. Higgs boson mass, compared to the SM expectation and 
to the expected limit from our previous analysis Q). Recent 
CDF results [f| are also shown. Solid (dashed) lines represent 
observed (expected) limits. The contribution of ZH recon- 
structed in the same final state is taken into account in the 
WH signal when deriving the limits, assuming the SM ratio 
of ZH/WH cross sections. 
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